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Abstract 



Recently, King and Ruskai conjectured that the maximal p-norm of the Werner- 
Holevo channel is multiplicative for all 1 < p < 2. In this paper we prove this conjecture. 
Our proof relies on certain convexity and monotonicity properties of the p-norm. 

1 Introduction 

cr. 



A quantum channel $ is described by a completely positive trace-preserving map [H1I3], which 
acts on an input density matrix p to yield the output $(p). Under the effect of noise present in 
the channel, pure input states are typically transformed into mixed output states. The amount 
of noise present in the channel can be estimated by determining how close the output $(p) is 
to a pure state when the input p is pure. In other words, the output purity provides a yardstick 
for the level of noise in the channel. There are various measures of output purity, one of them 
being the maximal p-norm of the channel. It is defined as follows 

z/ p ($):=sup{||$(p)|| p }, (1) 
p 

where ||$(p)|| p = Tr ($(p)) p is the p-norm of $(p) and 1 < p < oo. The case p = oo 

corresponds to the operator norm. In (P) the supremum is taken over all input density matrices. 
However, due to convexity of the map p i— > ||$(p)|| p , it suffices to restrict this supremum to 
pure states. It is clear that ||$(p)|| p < 1, since $(p) is a density matrix. The equality holds 
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if and only if the latter is a pure state. Hence v p {&) < 1 with equality if and only if there is 
a pure state p for which the output $(p) is also pure. Thus v p ($) provides a measure of the 
maximal purity of outputs from a quantum channel $. 

The maximal p-norms of two quantum channels $ and ^ are said to be multiplicative if 

i/ p ($®*) = i/ p ($).i/p(*). (2) 

This multiplicativity was conjectured by Amosov, Holevo and Werner in [Hj. In the limit 
P — ¥ 1 ? implies the additivity of another natural measure of the output purity, namely the 
von Neumann entropy. The multiplicativity (J2J) has been proved explicitly for various cases. For 
example, it is valid for all integer values of p, when $ and \P are tensor products of depolarizing 
channels jSJ. It also holds for all p > 1 when \l/ is an arbitrary quantum channel and $ is 
any one of the following: (i) an entanglement breaking channel ^2], (H) a unital qubit channel 
[TO] or (Hi) a depolarizing channel in any dimension [H]. However, it is now known that the 
conjecture is not true in general. A counterexample to the conjecture was given in 7J, for 
p > 4.79 in the case in which $ and \I/ are Werner-Holevo channels, defined by @. 

The Werner-Holevo channel $d of dimension d < oo is defined by its action on any complex 
d x d matrix p as follows 

**<J*) = -^( IT[ 0*)-i* T )- ( 3 ) 

Here p T denotes the transpose of p, and I is the d x d unit matrix. In particular, the action of 
the channel on any density matrix p is given by 

UP) = ^(1-P T )- (4) 

Werner and Holevo [7j proved that the conjecture (J2J) was false for p > 4.79 when $ = ^ = $^ 
with d = 3. The validity of the conjecture for smaller values of p for this channel was an open 
question. Recently it was proved PJ Ej that (j2J is true for p = 2 for the above channel $d with 
d > 2. In fact, in pQ the multiplicativity (J2J) was proved in a more general setting, namely one 
in which $ is a Werner-Holevo channel but ^ is any arbitrary channel. Moreover, in £[] the 
multiplicativity (J2J was conjectured to hold for all 1 < p < 2 for Werner-Holevo channels. This 
paper provides a proof of this conjecture. 

The precise statement of our result is given in Theorem 1 of Section |21 We would like to note 
that while writing our results, we were made aware of an almost simultaneous but independent 
and alternative proof of the conjecture put forth in £Q. This is contained in the recently posted 
body of work in However, not only do we present an alternative approach to the same 
conjecture, but this paper also provides the result encapsulated in Lemma El which would be 
of independent interest. 



2 Main result 

Following the discussion in the Introduction, we write the maximal p-norms for a single Werner- 
Holevo channel $d and the product channel ^ ® as 
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v p (* d ) = max{||$d(W^|)||p}, 

1*11=1 

and z/ p ($ dl ®$ d2 ) = max {||$i ® $ 2 )(|0i2>(0i2||| P } , (5) 

IV>12>6' M 1®' H 2 
IWl2ll=l 

respectively. In the above, TC ~ C d and TCi ~ C di for i = 1,2. Our main result is stated in the 
following theorem. 

Theorem 1 Let § d denote a Werner-Holevo channel of dimension d. Then the multiplicativity 
of the maximal p-norms 

v p ($ dl <g> $ d2 ) = v p (<$> dl ).v p (<$> d2 ), (6) 
holds for all 1 < p < 2, for arbitrary dimensions di,d2 > 2. 

To prove Theorem 1 we will make use of the method developed in |H| and of certain results 
proved in it. It is useful to consider the Schmidt decomposition of |0i 2 ) 

d 

^12) =^V / A>;1>® |a;2). (7) 

a=l 

Here <i = min[<ii,(i 2 ] and j)} is an orthonormal basis in Ti.^ % — 1,2. The Schmidt coef- 
ficients A a , a — 1, 2, . . . , d, and hence also the vector of Schmidt coefficients A := (Ai, . . . , A^), 
vary in the (d — 1)— dimensional simplex S^, defined by the constraints 

d 

A Q >0 ; ^A Q = 1. (8) 

a=l 

Note that the vertices of T> d correspond to unentangled vectors \1p12) = <8> l^)- To prove 
Theorem 1 it is sufficient to show that the maximum on the RHS of (JHJ) is achieved for unen- 
tangled vectors. In other words, we need to prove that this maximum occurs at the vertices of 
the simplex H d . 

Using the Schmidt decomposition ((7|), the input to the product channel can be expressed as 

d 

1^12)^12!= v 7 ^!"; 1 )^ 1 !® |a;2>(#2|. (9) 

a,P=l 

The output of the channel is the density matrix given by 

d 

012(A) := ($1 <g> $ 2 ) (1^12) (012 1) = Yl V^A^^i(|«;l)^;l|)(8)$ 2 (|a;2)^;2|). (10) 

a,/3=l 

We prove Theorem 1 by showing that 

[v P (® dl ® ®d 2 )] P = M$«OF \v P (<$>d 2 )] P for any d u d 2 >2 and 1< p < 2. 



3 



The multiplicativity © holds trivially for p = 1 since Tr p = 1 for any density matrix p. Note 
that 

® = max {||($i®$2)(hM(<M)lO 

11-01211=1 



did2 

Aes d 



max£(^(A)f, (11) 



8=1 



where {^(A),? = 1,2,... , g?ic/ 2 } denotes the set of eigenvalues of the channel output o"i 2 (A). 
These eigenvalues were studied in detail in jH] and were found to be divided into the three 
classes given below. Here we assume for defmiteness that d\ < <i 2 , so that d = d\. 

1. There are d(d — 1) eigenvalues given by 

e af3 := - 1 - (1 - A a - Xp) , a ^ /3, a, @ = 1, 2, . . . , d. 

2. There are d eigenvalues given by 

(1 - A Q ) 



each of multiplicity d 2 — d x . 
3. There are d eigenvalues of the form 



9a := ( dl - 1)^-1)' a = 1 ' 2 '-' d > ^ 



where 7 Q are the roots of the equation 

d ( d 

^(l-2A a ,- 7 ) 



n(1 _ 2A „_ 7) J 1 + E V l =0 . (13 ) 



0=1 



Using the constraint (jHj) we find that 

Ed! -2 t _ 1 



\<a,p<d * a =l 



and using the fact that the sum of all eigenvalues of o"i 2 (A) is equal to 1 we get 

Ed 2 — d\ 
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These relations allow us to define the following sets of non-negative variables 

:= (t — \ ) ^ a '^ = l,2...,d; a ^ /?, 




(14) 




(15) 



(16) 



E 



e. 



•a 



l<a,/3<d 



Hence, we can write 



012(A) Hp 



(^(A)) p 



cZ d 



ci ^ e a / + c 2 ^ /i a + c 3 ^2 9a P 




T 1 (A)+T 2 (A)+T 3 (A), 



(17) 



where Ci,C2 and C3 are constants depending on the dimensions di and d 2 . 

The function /(#) := x p , where 1 < p < 2 is convex for a; > 0. Hence, Ti(A) is a convex function 
of the variables e^g. These variables are affine functions of the Schmidt coefficients Ai, . . . , Ad- 
Hence, 7\(A) is a convex function of A and attains its global maximum at the vertices of the 
simplex E^. The same argument applies to T^A) since the variables h a are also affine functions 
of the Schmidt coefficients. The function T$(X) is however not necessarily a convex function of 
A. In spite of this, it too achieves its maximum value at the vertices of E^. This follows from 
the following theorem. 

Theorem 2 The function T 3 (A) is Schur-convex in A 6 E^ i.e., A -< A' ==>- T 3 ( A) < T 3 (A') ; 
where -< denotes the stochastic majorization (see the Appendix) . 

Since every A G is majorized by the vertices of E<j, Theorem 2 implies that T 3 (A) also 
attains its maximum at the vertices. Thus ||o"i2(A) Hp — ^i(A) + ^(A) + T 3 (A) is maximized at 
the vertices of E^. As was observed, this implies the multiplicativity (JHJ). 

To prove Theorem |21 we use the following lemma, which is proved in Sectional 

Lemma 3 Let f(x) := X^Li x i' w h ere 31 = (xi, %2, ■ ■ ■ , x n ) with each Xi > and Y^i=i x i = 1- 
For 1 < p < 2, f(x) is a monotonically non-increasing function of the elementary symmetric 
polynomials s q (xi, X2, ■ ■ ■ , x n ) for 2 < q < n, where 




for k = 1, 2, 3, ... ,n. 
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Note that T 3 (A) := c 3 ]T Q=1 g a p , where 

d 

ga>0 and ^<fo = 1. 

a=l 

The variables g a are proportional to the roots 7 a of eq.([T3|) (see (|T2j) and (HHJ)), which are 
obviously functions of the Schmidt vector A. Hence, by Lemma El T$(X) is a monotonically 
non-increasing function of the elementary symmetric polynomials 

Sfc(A) := s fc (7i,72, . . . ,7d), k = 0,...,d. (19) 

Therefore, to prove Theorem El it suffices to show that the functions sit (A) are Schur concave 
in A e E d . This Schur-concavity property of 5fe(A) was proved explicitly in 0. The proof of 
Lemma El therefore allows us to establish Theorem |21 and hence Theorem^ our main result. 
This is given in Section El Our proof is analogous to that of Theorem 1 of . 



3 Proof of Lemma El 

The variables x%, x 2 , ■ ■ ■ , x n , defined in Lemma El can be viewed as the eigenvalues of an n x n 
density matrix p n (say), and hence as the roots of the characteristic equation det(p n — a;l) = 0. 
Since the roots are the zeros of the product nr=i( x — x i)i ^ ne characteristic equation can be 
expressed in terms of these roots as follows: 

n 

x k (-l) n ~ fc s n . k (x 1} x 2} ...,x n ) = 0. (20) 

k=0 

Here the coefficient s n -k(xi, x%, ... , x n ) denotes the (n — k) th elementary symmetric polynomial 
of the variables xi,x 2 , ■ ■ ■ ,x n (defined by (fTH)l). We consider equation (|2TI|) to implicitly define 
the variables Xj = Xj(si, s 2 , ■ ■ ■ , s n ) as functions of the elementary symmetric polynomials. 
This can be done unambiguously as long as there are no multiple roots, i.e. Xi ^ Xj for i ^ j, 
i,j = 1,2, ... ,n We restrict our attention to this case at first, and prove that in the absence 
of multiple roots, df/dsi < for each % > 2. This will enable us, by continuity arguments, to 
conclude that / is indeed a monotonically non-increasing function of the elementary symmetric 
polynomials s 2 , S3, . . . , s n everywhere. 

Let us first prove that df/dsi < when the roots xi,x 2 , . . 
we can view the variables Xj to be implicitly defined by (|2~Uj) 
to s m , for 2 < m < n, we get 

d Xj _ (-l) m+1 x]- m 

ds m Y\i^j( X j ~ X i) 



. Xfi £1X6 all different. In this case 
Then differentiating with respect 

(21) 
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Using the chain rule and the definition of the function f(x) we get 

df A df dxj 



ds m Oxj ds m 



-r 



Xj yxj 



-i y j (22) 

j=1 Yli^j( X j ~ X i) 



To prove that f(x) is a monotonically non-increasing function of s m for each m = 2, 3, . . . , n, 
we use some standard results from Numerical Analysis {T^J. It is known that there is a unique 
polynomial of degree (n—1) which interpolates a given function g(x) at the points x±, X2, ■ ■ ■ , x n . 
The coefficient of x n ~ l of this polynomial is given by 



l n-l 



j= i lit^i^j 

called the Newton Divided Difference [T^] of the function g(a;). The expression on the RHS of 
({221) implies that df/ds m is the Newton Divided difference of the following function 

= (? m (x) = (-l) m+1 px ft - m4 *- 1 . (23) 

By the Hermite Gennochi theorem [T5] . the Newton Divided Difference is also given by the inte- 
gral of g^'^ipiXi + . . . +p n Xn) over the probability simplex {(pi,P2, • • • ,Pn),Pi > 0, ^"=iPi = 
1}, where g^'^^x) denotes the (n — l) th derivative of g(x). 

From f!23l) we obtain 

^-!)( x ) = (-l)" l+1 p( p - m + l)(p-m + 2)...(p-m + n-l) x p " m+1 . (24) 

It is easy to see that for all 2 < m < n, ^(x) < for all x, since 1 < p < 2. Hence the 
integral over the probability simplex is negative and we get df / ds m < as required. 

Therefore, df /ds m < everywhere except on the manifolds on which two or more of the roots 
xi,X2, ■ ■ ■ ,x n coincide. By continuity we deduce that / is a monotonically non-increasing 
function of the elementary symmetric polynomials S2, S3, . . . , s n everywhere, g 

Note that in Lemma El we considered 1 < p < 2. For the case p — 2, proceeding analogously to 
the proof of the above lemma, we find the following: in the absence of multiple roots 

df df 

— — < whereas 7: — = for all m = 3, 4, ... , n. (25) 
ds 2 ds m 

Hence by continuity, / is a monotonically non-increasing function of the elementary symmetric 
polynomial s 2 everywhere. The latter is however a Schur-concave function of the Schmidt 
vector A jU] . Hence for the case p = 2 as well, Theorem |21 applies and the multiplicativity 
stated in Theorem holds. 
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Appendix 



A real- valued function $ on R n is said to be Schur convex (see jTZj) if 

x-<y_ < <3>(y). 

Here the symbol x -< y means that x = (xi, x 2 , . . . , x n ) is majorized by y = y 2 , ■ ■ ■ , y n ) 
in the following sense: Let xj- be the vector obtained by rearranging the coordinates of x in 
decreasing order 

x} = (x[, x%, . . . , x^) means x\ > x\ > . . . > x^. 
For x, y G R", we say that x is majorized by y and write x -< y if 

k k 

and 

n n 
3=1 3=1 

In the simplex S d , defined by the constraints (jHJ), the minimal point is (l/d, . . . , 1/d) (the 
baricenter of E^), and the maximal points are the permutations of (1, 0, ... , 0) (the vertices). 
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